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Bottlenecks in Design 


• Simulations for design are at least an order of 
magnitude more challenging than their 
counterparts in analysis due to time constraints 

• Bottlenecks exist in all aspects of simulation 

- Setup (model manipulation and grid generation) 

- Solution (accuracy and solution time) 

- Post-processing (visualization, engineering figures, etc.) 

• Our R&D has been focusing on how to break 
some of these bottlenecks through 

- Automation 

- Large-scale parallelization 

- Advanced modeling 
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Example Areas 


• Examples of bottlenecks for simulations 

used in design: 

- Complex flow paths such as secondary-air and 
cooling systems 

- Complex aerothermal problems such as film-cooled, 
high-pressure turbines 

- Complex flows with mixing such as 
secondary/endwall flows 

• Similar bottlenecks exist in other Air Force, 

Navy, and NASA design problems 

- Our intent is to show new ideas that might be used in 
a broad range of applications 
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IDEA 


• Instead of generating a computational grid 
after the model is created, why not generate 
it at the same time and take advantage of the 
CAD operations??? 
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CAD-Based Grid Generation 


--j 


For each CAD primitive, 
construct a grid inside and 
outside 

When CAD operations are 
performed to construct 
model, apply them to the 
computational grids as 
well 

CAD operations consist of 

- Union 

- Intersection 

- Difference 




(a) primitive B is a box and C is a circle 



(c) B — C 



(b) BUC 



(d) B(1C 
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Application to Gas-Turbines 




Secondary 

CAD Model 


-Air 


System Flow Path 

Resulting Grid 


• For more details: AIAA-2009-3991 
“Automated Creation of Overset Grids 
Directly from Solid-Model Feature Trees” 
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IDEA 


• Why not use similar embedded, overset grid 
technology for multi-disciplinary problems 
such as conjugate heat transfer of film-cooled 
turbines??? 
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Automated Model/Grid Generation of 
Turbine Airfoil Sections 

* Conjugate heat transfer of various HPTs including 
TBC, cooling holes, walls, and plenums 


O 



Extension to 3D is 
relatively 
straightforward! 


Overset film cooling 
holes grids can be 
changed for 
optimization without 
affecting remainder 
of grid 


Every 4 th grid point shown for clarity 

Completely General: Can also be used for flow control 
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Stagnation Temperature Contours 

* Time-averaged Detached-Eddy Simulations are the Norm! 

- Enables effects of self-excited unsteadiness on aerothermal performance to be 
predicted 

- Shown here: Instantaneous DES results 



'32.000 


Trailing-edge 
shedding effects 
on thermal mixing 
predicted 



1 1 10.000 


50.000 


Sheddina subdued 

^ - 

with trailing-edge 
blowing 
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Temperature Contours 



Cooling flow “lift- 
off” can easily be 
seen 


Region of cooling 
effectiveness can 
easily be seen 


* Interaction effects on both flow and wall temperature 
distributions are shown 


Effect of 
cooling flow 
on wall _ 
temperature 
can be 
easily seen 


■ 0.030 


Thermal barrier 
coating included 


Wall hot spots can 
be easily seen 
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Wall Temperature Distribution 


• Wall temperature distributions show extent of 
cooling effectiveness 



X/Bx 
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For More Details 


• See: 

AIAA-2010- “Conjugate Design/Analysis 
Procedure for Film-Cooled Turbine Airfoil 
Sections” to be published January 2010 

AIAA-2009-913 “A Conjugate Heat Transfer 
RANS/DES Simulation Procedure” 

AIAA-2008-4407 “Transitional RANS/DES 
Computations of Turbine Heat Transfer” 

AIAA-2008-534 “A Detached-Eddy Simulation 
Procedure Targeted for Design” 
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IDEA 


• Why not use the latest parallel computing 
technology to turn around complex 
multi-disciplinary solutions fast enough for 
automated design optimization??? 
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Let’s Use GPUs with CPUs! 

• Graphical processing units (GPUs) have 
proven success for gaming applications 

• We have recently shown GPUs to also be 
useful for scientific simulations 

• GPU Costs: 

— $500 for 128 floating-point units 

- Example : Our GPU cluster in ECE 

• 8 nodes of single quad-cores (32 cores) 

• 1 GPU per core -> 32 GPUs 

• 12 Teraflops of peak performance, ~$25,000-$30,000 

- Low space and power requirements 

1 

• Cost Effective Means of Achieving our Goals! 
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Our Latest Results 

* Argonne National Laboratories 32 CPU/GPU cluster 

GPU Cluster Speedup : MBFL02 





Domain 

Size 


-♦-GPU (800K) 
-■-CPU (800K) 
-A-GPU (3.2M) 
— H-CPU (3.2M) 
ideal GPU 
—•—ideal CPU 


Number of GPU /CPU 
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Summary of Performance Gains 

• For 32 CPU/GPU combined processors 

- MBFLO speed-up increased to -146 over single CPU 
(66% improvement over 16 combined processors) on 
Argonne National Laboratory cluster. 

• A factor of 5.4 over 32 CPUs at 85% efficiency. 

• Over a factor of 30 in performance/price ratio! 

oo 

• Greater speed-ups with larger data sets and 
more computing 

- Great performance for 3D turbulent computations! 

• Goal to turn-around conjugate heat-transfer 
DES simulations in a few minutes so that 
automated optimization can be used 

- Currently at -30 minutes solution time for TA’d DES 
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For More Details 



VO 


AIAA-2009-565 “Rapid Aerodynamic 
Performance Prediction on a Cluster of 
Graphical Processing Units” 
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Overall Summary 


* New ideas are forthcoming to break existing 
bottlenecks in using CFD during design 

- CAD-based automated grid generation 

- Multi-disciplinary use of embedded, overset grids to eliminate 
complex gridding problems 

- Use of time-averaged detached-eddy simulations as norm 
instead of “steady” RANS to include effects of self-excited 
unsteadiness 

- Combined GPU/Core parallel computing to provide over an order 
of magnitude increase in performance/price ratio 

* Gas-turbine applications are shown here but these 
ideas can be used for other Air Force, Navy, and 
NASA applications 


